Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 3953 |
| Missing cells | 1047 |
| Missing cells (%) | 1.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 772.2 KiB |
| Average record size in memory | 200.0 B |
Variable types
| Categorical | 15 |
|---|---|
| Numeric | 10 |
Name has a high cardinality: 3682 distinct values | High cardinality |
Email ID has a high cardinality: 3373 distinct values | High cardinality |
Dt_Applied has a high cardinality: 3953 distinct values | High cardinality |
University has a high cardinality: 3140 distinct values | High cardinality |
Zip Code has a high cardinality: 615 distinct values | High cardinality |
Loan Amnt is highly correlated with Funded amnt inv and 2 other fields | High correlation |
Funded amnt inv is highly correlated with Loan Amnt and 2 other fields | High correlation |
INSTALLMENT is highly correlated with Loan Amnt and 2 other fields | High correlation |
Total Paymnt is highly correlated with Loan Amnt and 2 other fields | High correlation |
Loan Amnt is highly correlated with Funded amnt inv and 2 other fields | High correlation |
Funded amnt inv is highly correlated with Loan Amnt and 2 other fields | High correlation |
INSTALLMENT is highly correlated with Loan Amnt and 2 other fields | High correlation |
Total Paymnt is highly correlated with Loan Amnt and 2 other fields | High correlation |
Loan Amnt is highly correlated with Funded amnt inv and 2 other fields | High correlation |
Funded amnt inv is highly correlated with Loan Amnt and 2 other fields | High correlation |
INSTALLMENT is highly correlated with Loan Amnt and 2 other fields | High correlation |
Total Paymnt is highly correlated with Loan Amnt and 2 other fields | High correlation |
INSTALLMENT is highly correlated with Total Paymnt and 2 other fields | High correlation |
Sub Grade is highly correlated with TERM and 2 other fields | High correlation |
Total Paymnt is highly correlated with INSTALLMENT and 3 other fields | High correlation |
Funded amnt inv is highly correlated with INSTALLMENT and 4 other fields | High correlation |
Loan Amnt is highly correlated with INSTALLMENT and 4 other fields | High correlation |
Verification Status is highly correlated with Funded amnt inv and 1 other fields | High correlation |
TERM is highly correlated with Sub Grade and 5 other fields | High correlation |
GRADE is highly correlated with Sub Grade and 2 other fields | High correlation |
Int Rate is highly correlated with Sub Grade and 2 other fields | High correlation |
TERM is highly correlated with GRADE and 1 other fields | High correlation |
GRADE is highly correlated with TERM and 1 other fields | High correlation |
Sub Grade is highly correlated with TERM and 1 other fields | High correlation |
Name has 271 (6.9%) missing values | Missing |
Email ID has 580 (14.7%) missing values | Missing |
Gender has 78 (2.0%) missing values | Missing |
University has 118 (3.0%) missing values | Missing |
Name is uniformly distributed | Uniform |
Email ID is uniformly distributed | Uniform |
Dt_Applied is uniformly distributed | Uniform |
University is uniformly distributed | Uniform |
Dt_Applied has unique values | Unique |
Delinq 2Yrs has 3628 (91.8%) zeros | Zeros |
Inq Last 6Mths has 1822 (46.1%) zeros | Zeros |
Revol Bal has 42 (1.1%) zeros | Zeros |
Reproduction
| Analysis started | 2021-06-30 14:59:56.903909 |
|---|---|
| Analysis finished | 2021-06-30 15:00:15.090285 |
| Duration | 18.19 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 3682 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 271 |
| Missing (%) | 6.9% |
| Memory size | 31.0 KiB |
| Stormy Gerauld | 1 |
|---|---|
| Perkin Gomersall | 1 |
| Aviva Cody | 1 |
| Rusty Netley | 1 |
| Egbert Huegett | 1 |
| Other values (3677) |
Length
| Max length | 23 |
|---|---|
| Median length | 14 |
| Mean length | 14.03286257 |
| Min length | 7 |
Characters and Unicode
| Total characters | 51669 |
|---|---|
| Distinct characters | 58 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3682 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Calley Giron |
|---|---|
| 2nd row | Linus Stud |
| 3rd row | Lorelle Ambage |
| 4th row | Anna-diane Larrat |
| 5th row | Gill Ruske |
Common Values
| Value | Count | Frequency (%) |
| Stormy Gerauld | 1 | < 0.1% |
| Perkin Gomersall | 1 | < 0.1% |
| Aviva Cody | 1 | < 0.1% |
| Rusty Netley | 1 | < 0.1% |
| Egbert Huegett | 1 | < 0.1% |
| Elvis Farden | 1 | < 0.1% |
| Ilka Exer | 1 | < 0.1% |
| Starlin Aidler | 1 | < 0.1% |
| Clerissa Branchett | 1 | < 0.1% |
| Dicky McGunley | 1 | < 0.1% |
| Other values (3672) | 3672 | |
| (Missing) | 271 | 6.9% |
Length
| Value | Count | Frequency (%) |
| de | 20 | 0.3% |
| le | 6 | 0.1% |
| dee | 5 | 0.1% |
| van | 5 | 0.1% |
| kerry | 4 | 0.1% |
| salomo | 4 | 0.1% |
| derril | 4 | 0.1% |
| gill | 4 | 0.1% |
| glad | 4 | 0.1% |
| imogen | 4 | 0.1% |
| Other values (6460) | 7355 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5280 | 10.2% |
| a | 4505 | 8.7% |
| 3733 | 7.2% | |
| n | 3525 | 6.8% |
| i | 3500 | 6.8% |
| r | 3444 | 6.7% |
| l | 3183 | 6.2% |
| o | 2704 | 5.2% |
| t | 2023 | 3.9% |
| s | 1723 | 3.3% |
| Other values (48) | 18049 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 40336 | |
| Uppercase Letter | 7550 | 14.6% |
| Space Separator | 3733 | 7.2% |
| Other Punctuation | 35 | 0.1% |
| Dash Punctuation | 14 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 678 | 9.0% |
| M | 600 | 7.9% |
| B | 596 | 7.9% |
| S | 573 | 7.6% |
| D | 479 | 6.3% |
| A | 451 | 6.0% |
| G | 443 | 5.9% |
| R | 400 | 5.3% |
| L | 395 | 5.2% |
| H | 326 | 4.3% |
| Other values (16) | 2609 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5280 | |
| a | 4505 | |
| n | 3525 | 8.7% |
| i | 3500 | 8.7% |
| r | 3444 | 8.5% |
| l | 3183 | 7.9% |
| o | 2704 | 6.7% |
| t | 2023 | 5.0% |
| s | 1723 | 4.3% |
| d | 1395 | 3.5% |
| Other values (16) | 9054 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 32 | |
| . | 2 | 5.7% |
| ; | 1 | 2.9% |
Space Separator
| Value | Count | Frequency (%) |
| 3733 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 14 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 47886 | |
| Common | 3783 | 7.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5280 | 11.0% |
| a | 4505 | 9.4% |
| n | 3525 | 7.4% |
| i | 3500 | 7.3% |
| r | 3444 | 7.2% |
| l | 3183 | 6.6% |
| o | 2704 | 5.6% |
| t | 2023 | 4.2% |
| s | 1723 | 3.6% |
| d | 1395 | 2.9% |
| Other values (42) | 16604 |
Common
| Value | Count | Frequency (%) |
| 3733 | ||
| ' | 32 | 0.8% |
| - | 14 | 0.4% |
| . | 2 | 0.1% |
| ] | 1 | < 0.1% |
| ; | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51669 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 5280 | 10.2% |
| a | 4505 | 8.7% |
| 3733 | 7.2% | |
| n | 3525 | 6.8% |
| i | 3500 | 6.8% |
| r | 3444 | 6.7% |
| l | 3183 | 6.2% |
| o | 2704 | 5.2% |
| t | 2023 | 3.9% |
| s | 1723 | 3.3% |
| Other values (48) | 18049 |
| Distinct | 3373 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 580 |
| Missing (%) | 14.7% |
| Memory size | 31.0 KiB |
| lstud1@washington.edu | 1 |
|---|---|
| tveregan4t@tamu.edu | 1 |
| tdilnotfp@aol.com | 1 |
| ndoughty3w@google.com.br | 1 |
| ebrownfieldn0@php.net | 1 |
| Other values (3368) |
Length
| Max length | 35 |
|---|---|
| Median length | 22 |
| Mean length | 21.83308627 |
| Min length | 11 |
Characters and Unicode
| Total characters | 73643 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3373 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | cgiron0@ehow.com |
|---|---|
| 2nd row | lstud1@washington.edu |
| 3rd row | lambage2@wix.com |
| 4th row | alarrat3@economist.com |
| 5th row | emacfaul5@theatlantic.com |
Common Values
| Value | Count | Frequency (%) |
| lstud1@washington.edu | 1 | < 0.1% |
| tveregan4t@tamu.edu | 1 | < 0.1% |
| tdilnotfp@aol.com | 1 | < 0.1% |
| ndoughty3w@google.com.br | 1 | < 0.1% |
| ebrownfieldn0@php.net | 1 | < 0.1% |
| jfleetwood1m@google.com | 1 | < 0.1% |
| knormington1@amazon.co.uk | 1 | < 0.1% |
| cmillmoebf@arizona.edu | 1 | < 0.1% |
| lmccahey5s@addthis.com | 1 | < 0.1% |
| lclauspo@networksolutions.com | 1 | < 0.1% |
| Other values (3363) | 3363 | |
| (Missing) | 580 | 14.7% |
Length
| Value | Count | Frequency (%) |
| falywen26@columbia.edu | 1 | < 0.1% |
| rlagem3@fda.gov | 1 | < 0.1% |
| tdilnotfp@aol.com | 1 | < 0.1% |
| ndoughty3w@google.com.br | 1 | < 0.1% |
| ebrownfieldn0@php.net | 1 | < 0.1% |
| jfleetwood1m@google.com | 1 | < 0.1% |
| knormington1@amazon.co.uk | 1 | < 0.1% |
| cmillmoebf@arizona.edu | 1 | < 0.1% |
| lmccahey5s@addthis.com | 1 | < 0.1% |
| lclauspo@networksolutions.com | 1 | < 0.1% |
| Other values (3363) | 3363 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 6259 | 8.5% |
| e | 5765 | 7.8% |
| c | 4676 | 6.3% |
| a | 4491 | 6.1% |
| m | 4076 | 5.5% |
| . | 3699 | 5.0% |
| r | 3657 | 5.0% |
| n | 3612 | 4.9% |
| i | 3589 | 4.9% |
| @ | 3373 | 4.6% |
| Other values (29) | 30446 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 64215 | |
| Other Punctuation | 7072 | 9.6% |
| Decimal Number | 2273 | 3.1% |
| Dash Punctuation | 83 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 6259 | 9.7% |
| e | 5765 | 9.0% |
| c | 4676 | 7.3% |
| a | 4491 | 7.0% |
| m | 4076 | 6.3% |
| r | 3657 | 5.7% |
| n | 3612 | 5.6% |
| i | 3589 | 5.6% |
| l | 3340 | 5.2% |
| s | 3154 | 4.9% |
| Other values (16) | 21596 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 265 | |
| 1 | 260 | |
| 3 | 260 | |
| 6 | 243 | |
| 4 | 243 | |
| 8 | 239 | |
| 5 | 227 | |
| 9 | 215 | |
| 7 | 210 | |
| 0 | 111 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3699 | |
| @ | 3373 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 83 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 64215 | |
| Common | 9428 | 12.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 6259 | 9.7% |
| e | 5765 | 9.0% |
| c | 4676 | 7.3% |
| a | 4491 | 7.0% |
| m | 4076 | 6.3% |
| r | 3657 | 5.7% |
| n | 3612 | 5.6% |
| i | 3589 | 5.6% |
| l | 3340 | 5.2% |
| s | 3154 | 4.9% |
| Other values (16) | 21596 |
Common
| Value | Count | Frequency (%) |
| . | 3699 | |
| @ | 3373 | |
| 2 | 265 | 2.8% |
| 1 | 260 | 2.8% |
| 3 | 260 | 2.8% |
| 6 | 243 | 2.6% |
| 4 | 243 | 2.6% |
| 8 | 239 | 2.5% |
| 5 | 227 | 2.4% |
| 9 | 215 | 2.3% |
| Other values (3) | 404 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 73643 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 6259 | 8.5% |
| e | 5765 | 7.8% |
| c | 4676 | 6.3% |
| a | 4491 | 6.1% |
| m | 4076 | 5.5% |
| . | 3699 | 5.0% |
| r | 3657 | 5.0% |
| n | 3612 | 4.9% |
| i | 3589 | 4.9% |
| @ | 3373 | 4.6% |
| Other values (29) | 30446 |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 78 |
| Missing (%) | 2.0% |
| Memory size | 31.0 KiB |
| Male | |
|---|---|
| Female |
Length
| Max length | 6 |
|---|---|
| Median length | 4 |
| Mean length | 4.983225806 |
| Min length | 4 |
Characters and Unicode
| Total characters | 19310 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Male |
| 3rd row | Female |
| 4th row | Female |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| Male | 1970 | |
| Female | 1905 | |
| (Missing) | 78 | 2.0% |
Length
Pie chart
| Value | Count | Frequency (%) |
| male | 1970 | |
| female | 1905 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5780 | |
| a | 3875 | |
| l | 3875 | |
| M | 1970 | 10.2% |
| F | 1905 | 9.9% |
| m | 1905 | 9.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15435 | |
| Uppercase Letter | 3875 | 20.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5780 | |
| a | 3875 | |
| l | 3875 | |
| m | 1905 | 12.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1970 | |
| F | 1905 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19310 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5780 | |
| a | 3875 | |
| l | 3875 | |
| M | 1970 | 10.2% |
| F | 1905 | 9.9% |
| m | 1905 | 9.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19310 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 5780 | |
| a | 3875 | |
| l | 3875 | |
| M | 1970 | 10.2% |
| F | 1905 | 9.9% |
| m | 1905 | 9.9% |
| Distinct | 3953 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.0 KiB |
| 07/03/91 | 1 |
|---|---|
| 14/02/89 | 1 |
| 19/11/84 | 1 |
| 14/10/83 | 1 |
| 15/05/83 | 1 |
| Other values (3948) |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 31624 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3953 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 01/01/81 |
|---|---|
| 2nd row | 02/01/81 |
| 3rd row | 03/01/81 |
| 4th row | 04/01/81 |
| 5th row | 05/01/81 |
Common Values
| Value | Count | Frequency (%) |
| 07/03/91 | 1 | < 0.1% |
| 14/02/89 | 1 | < 0.1% |
| 19/11/84 | 1 | < 0.1% |
| 14/10/83 | 1 | < 0.1% |
| 15/05/83 | 1 | < 0.1% |
| 06/05/87 | 1 | < 0.1% |
| 13/11/84 | 1 | < 0.1% |
| 05/11/89 | 1 | < 0.1% |
| 11/08/88 | 1 | < 0.1% |
| 11/10/91 | 1 | < 0.1% |
| Other values (3943) | 3943 |
Length
| Value | Count | Frequency (%) |
| 07/03/91 | 1 | < 0.1% |
| 14/02/89 | 1 | < 0.1% |
| 19/11/84 | 1 | < 0.1% |
| 14/10/83 | 1 | < 0.1% |
| 15/05/83 | 1 | < 0.1% |
| 06/05/87 | 1 | < 0.1% |
| 13/11/84 | 1 | < 0.1% |
| 05/11/89 | 1 | < 0.1% |
| 11/08/88 | 1 | < 0.1% |
| 11/10/91 | 1 | < 0.1% |
| Other values (3943) | 3943 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 7906 | |
| 0 | 5259 | |
| 8 | 4381 | |
| 1 | 4020 | |
| 2 | 2664 | 8.4% |
| 9 | 1744 | 5.5% |
| 3 | 1288 | 4.1% |
| 5 | 1096 | 3.5% |
| 7 | 1096 | 3.5% |
| 4 | 1085 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 23718 | |
| Other Punctuation | 7906 | 25.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5259 | |
| 8 | 4381 | |
| 1 | 4020 | |
| 2 | 2664 | |
| 9 | 1744 | 7.4% |
| 3 | 1288 | 5.4% |
| 5 | 1096 | 4.6% |
| 7 | 1096 | 4.6% |
| 4 | 1085 | 4.6% |
| 6 | 1085 | 4.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 7906 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 31624 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 7906 | |
| 0 | 5259 | |
| 8 | 4381 | |
| 1 | 4020 | |
| 2 | 2664 | 8.4% |
| 9 | 1744 | 5.5% |
| 3 | 1288 | 4.1% |
| 5 | 1096 | 3.5% |
| 7 | 1096 | 3.5% |
| 4 | 1085 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31624 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 7906 | |
| 0 | 5259 | |
| 8 | 4381 | |
| 1 | 4020 | |
| 2 | 2664 | 8.4% |
| 9 | 1744 | 5.5% |
| 3 | 1288 | 4.1% |
| 5 | 1096 | 3.5% |
| 7 | 1096 | 3.5% |
| 4 | 1085 | 3.4% |
| Distinct | 3140 |
|---|---|
| Distinct (%) | 81.9% |
| Missing | 118 |
| Missing (%) | 3.0% |
| Memory size | 31.0 KiB |
| Tampere Polytechnic | 4 |
|---|---|
| Fukuoka Institute of Technology | 4 |
| Jiangxi University of Traditional Chinese Medicine | 4 |
| Arab Open University | 4 |
| Abant Izzet Baysal University | 4 |
| Other values (3135) |
Length
| Max length | 114 |
|---|---|
| Median length | 29 |
| Mean length | 30.49152542 |
| Min length | 11 |
Characters and Unicode
| Total characters | 116935 |
|---|---|
| Distinct characters | 98 |
| Distinct categories | 11 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 4 ? |
Unique
| Unique | 2542 ? |
|---|---|
| Unique (%) | 66.3% |
Sample
| 1st row | Warner Southern College |
|---|---|
| 2nd row | Shri Lal Bahadur Shastri Rashtriya Sanskrit Vidyapeetha |
| 3rd row | Technische Universität Bergakademie Freiberg |
| 4th row | Divine Word College of Legazpi |
| 5th row | East China Jiao Tong University |
Common Values
| Value | Count | Frequency (%) |
| Tampere Polytechnic | 4 | 0.1% |
| Fukuoka Institute of Technology | 4 | 0.1% |
| Jiangxi University of Traditional Chinese Medicine | 4 | 0.1% |
| Arab Open University | 4 | 0.1% |
| Abant Izzet Baysal University | 4 | 0.1% |
| Phillips Graduate Institute | 4 | 0.1% |
| Stavropol State Technical University | 4 | 0.1% |
| Universidad de Congreso | 4 | 0.1% |
| Universidad Valle del Momboy | 4 | 0.1% |
| Carlow College | 4 | 0.1% |
| Other values (3130) | 3795 | |
| (Missing) | 118 | 3.0% |
Length
| Value | Count | Frequency (%) |
| university | 2142 | 14.3% |
| of | 1144 | 7.6% |
| college | 544 | 3.6% |
| de | 397 | 2.7% |
| universidad | 307 | 2.0% |
| state | 274 | 1.8% |
| institute | 220 | 1.5% |
| and | 197 | 1.3% |
| technology | 195 | 1.3% |
| 113 | 0.8% | |
| Other values (3562) | 9447 |
Most occurring characters
| Value | Count | Frequency (%) |
| 11198 | 9.6% | |
| i | 10758 | 9.2% |
| e | 10464 | 8.9% |
| n | 8336 | 7.1% |
| a | 7981 | 6.8% |
| t | 6906 | 5.9% |
| r | 6300 | 5.4% |
| o | 6107 | 5.2% |
| s | 5789 | 5.0% |
| l | 4376 | 3.7% |
| Other values (88) | 38720 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 91706 | |
| Uppercase Letter | 13156 | 11.3% |
| Space Separator | 11198 | 9.6% |
| Other Punctuation | 508 | 0.4% |
| Dash Punctuation | 186 | 0.2% |
| Open Punctuation | 79 | 0.1% |
| Close Punctuation | 79 | 0.1% |
| Decimal Number | 19 | < 0.1% |
| Control | 2 | < 0.1% |
| Initial Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 10758 | |
| e | 10464 | |
| n | 8336 | 9.1% |
| a | 7981 | 8.7% |
| t | 6906 | 7.5% |
| r | 6300 | 6.9% |
| o | 6107 | 6.7% |
| s | 5789 | 6.3% |
| l | 4376 | 4.8% |
| y | 3155 | 3.4% |
| Other values (37) | 21534 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 2851 | |
| S | 1352 | |
| C | 1236 | 9.4% |
| A | 857 | 6.5% |
| M | 785 | 6.0% |
| T | 762 | 5.8% |
| I | 708 | 5.4% |
| N | 522 | 4.0% |
| P | 492 | 3.7% |
| B | 422 | 3.2% |
| Other values (19) | 3169 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 7 | |
| 7 | 3 | |
| 9 | 2 | 10.5% |
| 4 | 2 | 10.5% |
| 5 | 2 | 10.5% |
| 3 | 1 | 5.3% |
| 2 | 1 | 5.3% |
| 8 | 1 | 5.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 200 | |
| . | 106 | |
| ' | 91 | |
| " | 62 | 12.2% |
| & | 35 | 6.9% |
| / | 14 | 2.8% |
Control
| Value | Count | Frequency (%) |
| | 1 | |
| | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 11198 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 186 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 79 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 79 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 104862 | |
| Common | 12073 | 10.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 10758 | 10.3% |
| e | 10464 | 10.0% |
| n | 8336 | 7.9% |
| a | 7981 | 7.6% |
| t | 6906 | 6.6% |
| r | 6300 | 6.0% |
| o | 6107 | 5.8% |
| s | 5789 | 5.5% |
| l | 4376 | 4.2% |
| y | 3155 | 3.0% |
| Other values (66) | 34690 |
Common
| Value | Count | Frequency (%) |
| 11198 | ||
| , | 200 | 1.7% |
| - | 186 | 1.5% |
| . | 106 | 0.9% |
| ' | 91 | 0.8% |
| ( | 79 | 0.7% |
| ) | 79 | 0.7% |
| " | 62 | 0.5% |
| & | 35 | 0.3% |
| / | 14 | 0.1% |
| Other values (12) | 23 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 116357 | |
| Latin 1 Sup | 569 | 0.5% |
| Latin Ext A | 7 | < 0.1% |
| Punctuation | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 11198 | 9.6% | |
| i | 10758 | 9.2% |
| e | 10464 | 9.0% |
| n | 8336 | 7.2% |
| a | 7981 | 6.9% |
| t | 6906 | 5.9% |
| r | 6300 | 5.4% |
| o | 6107 | 5.2% |
| s | 5789 | 5.0% |
| l | 4376 | 3.8% |
| Other values (60) | 38142 |
Latin 1 Sup
| Value | Count | Frequency (%) |
| é | 211 | |
| ó | 90 | |
| ä | 65 | 11.4% |
| ü | 59 | 10.4% |
| á | 43 | 7.6% |
| í | 33 | 5.8% |
| è | 11 | 1.9% |
| ñ | 9 | 1.6% |
| ç | 9 | 1.6% |
| ú | 8 | 1.4% |
| Other values (12) | 31 | 5.4% |
Punctuation
| Value | Count | Frequency (%) |
| “ | 1 | |
| ” | 1 |
Latin Ext A
| Value | Count | Frequency (%) |
| č | 4 | |
| Š | 1 | 14.3% |
| ı | 1 | 14.3% |
| ž | 1 | 14.3% |
| Distinct | 434 |
|---|---|
| Distinct (%) | 11.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13017.49937 |
| Minimum | 1000 |
|---|---|
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 1000 |
|---|---|
| 5-th percentile | 3000 |
| Q1 | 6500 |
| median | 12000 |
| Q3 | 17625 |
| 95-th percentile | 30000 |
| Maximum | 35000 |
| Range | 34000 |
| Interquartile range (IQR) | 11125 |
Descriptive statistics
| Standard deviation | 8155.330342 |
|---|---|
| Coefficient of variation (CV) | 0.6264897821 |
| Kurtosis | 0.3258532123 |
| Mean | 13017.49937 |
| Median Absolute Deviation (MAD) | 5500 |
| Skewness | 0.9233128761 |
| Sum | 51458175 |
| Variance | 66509412.98 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12000 | 315 | 8.0% |
| 10000 | 259 | 6.6% |
| 15000 | 190 | 4.8% |
| 20000 | 174 | 4.4% |
| 6000 | 165 | 4.2% |
| 5000 | 153 | 3.9% |
| 35000 | 143 | 3.6% |
| 8000 | 124 | 3.1% |
| 16000 | 99 | 2.5% |
| 25000 | 97 | 2.5% |
| Other values (424) | 2234 |
| Value | Count | Frequency (%) |
| 1000 | 21 | |
| 1100 | 1 | < 0.1% |
| 1200 | 9 | |
| 1300 | 2 | 0.1% |
| 1325 | 1 | < 0.1% |
| 1400 | 3 | 0.1% |
| 1450 | 2 | 0.1% |
| 1500 | 11 | |
| 1600 | 6 | 0.2% |
| 1700 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 35000 | 143 | |
| 34475 | 1 | < 0.1% |
| 34000 | 2 | 0.1% |
| 33950 | 1 | < 0.1% |
| 33600 | 2 | 0.1% |
| 33425 | 1 | < 0.1% |
| 33000 | 1 | < 0.1% |
| 32875 | 1 | < 0.1% |
| 32275 | 1 | < 0.1% |
| 32000 | 3 | 0.1% |
Funded amnt inv
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 828 |
|---|---|
| Distinct (%) | 20.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12809.79216 |
| Minimum | 750 |
|---|---|
| Maximum | 35000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 750 |
|---|---|
| 5-th percentile | 3000 |
| Q1 | 6500 |
| median | 11775 |
| Q3 | 17000 |
| 95-th percentile | 29735 |
| Maximum | 35000 |
| Range | 34250 |
| Interquartile range (IQR) | 10500 |
Descriptive statistics
| Standard deviation | 7935.907682 |
|---|---|
| Coefficient of variation (CV) | 0.619518848 |
| Kurtosis | 0.3951370723 |
| Mean | 12809.79216 |
| Median Absolute Deviation (MAD) | 5275 |
| Skewness | 0.9263171893 |
| Sum | 50637108.41 |
| Variance | 62978630.74 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12000 | 249 | 6.3% |
| 10000 | 222 | 5.6% |
| 6000 | 153 | 3.9% |
| 5000 | 143 | 3.6% |
| 15000 | 139 | 3.5% |
| 8000 | 113 | 2.9% |
| 7000 | 87 | 2.2% |
| 3000 | 74 | 1.9% |
| 20000 | 72 | 1.8% |
| 14000 | 64 | 1.6% |
| Other values (818) | 2637 |
| Value | Count | Frequency (%) |
| 750 | 1 | < 0.1% |
| 1000 | 20 | |
| 1100 | 1 | < 0.1% |
| 1200 | 9 | |
| 1300 | 2 | 0.1% |
| 1325 | 1 | < 0.1% |
| 1400 | 3 | 0.1% |
| 1450 | 2 | 0.1% |
| 1500 | 11 | |
| 1600 | 6 | 0.2% |
| Value | Count | Frequency (%) |
| 35000 | 37 | |
| 34997.35245 | 1 | < 0.1% |
| 34993.65539 | 1 | < 0.1% |
| 34987.98452 | 1 | < 0.1% |
| 34987.27101 | 1 | < 0.1% |
| 34977.34674 | 1 | < 0.1% |
| 34975.81636 | 1 | < 0.1% |
| 34975 | 14 | 0.4% |
| 34972.8295 | 1 | < 0.1% |
| 34972.50393 | 1 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.0 KiB |
| 36 months | |
|---|---|
| 60 months |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Characters and Unicode
| Total characters | 39530 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 36 months |
|---|---|
| 2nd row | 60 months |
| 3rd row | 36 months |
| 4th row | 36 months |
| 5th row | 60 months |
Common Values
| Value | Count | Frequency (%) |
| 36 months | 2687 | |
| 60 months | 1266 |
Length
Pie chart
| Value | Count | Frequency (%) |
| months | 3953 | |
| 36 | 2687 | |
| 60 | 1266 | 16.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7906 | ||
| 6 | 3953 | |
| m | 3953 | |
| o | 3953 | |
| n | 3953 | |
| t | 3953 | |
| h | 3953 | |
| s | 3953 | |
| 3 | 2687 | 6.8% |
| 0 | 1266 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23718 | |
| Space Separator | 7906 | 20.0% |
| Decimal Number | 7906 | 20.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 3953 | |
| o | 3953 | |
| n | 3953 | |
| t | 3953 | |
| h | 3953 | |
| s | 3953 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 3953 | |
| 3 | 2687 | |
| 0 | 1266 | 16.0% |
Space Separator
| Value | Count | Frequency (%) |
| 7906 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23718 | |
| Common | 15812 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 3953 | |
| o | 3953 | |
| n | 3953 | |
| t | 3953 | |
| h | 3953 | |
| s | 3953 |
Common
| Value | Count | Frequency (%) |
| 7906 | ||
| 6 | 3953 | |
| 3 | 2687 | 17.0% |
| 0 | 1266 | 8.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 39530 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7906 | ||
| 6 | 3953 | |
| m | 3953 | |
| o | 3953 | |
| n | 3953 | |
| t | 3953 | |
| h | 3953 | |
| s | 3953 | |
| 3 | 2687 | 6.8% |
| 0 | 1266 | 3.2% |
| Distinct | 35 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1296908677 |
| Minimum | 0.06 |
|---|---|
| Maximum | 0.241 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 0.06 |
|---|---|
| 5-th percentile | 0.066 |
| Q1 | 0.099 |
| median | 0.127 |
| Q3 | 0.16 |
| 95-th percentile | 0.203 |
| Maximum | 0.241 |
| Range | 0.181 |
| Interquartile range (IQR) | 0.061 |
Descriptive statistics
| Standard deviation | 0.04160931484 |
|---|---|
| Coefficient of variation (CV) | 0.3208345782 |
| Kurtosis | -0.6951924625 |
| Mean | 0.1296908677 |
| Median Absolute Deviation (MAD) | 0.033 |
| Skewness | 0.226416223 |
| Sum | 512.668 |
| Variance | 0.001731335081 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.117 | 324 | 8.2% |
| 0.079 | 259 | 6.6% |
| 0.127 | 259 | 6.6% |
| 0.124 | 254 | 6.4% |
| 0.135 | 231 | 5.8% |
| 0.143 | 226 | 5.7% |
| 0.107 | 213 | 5.4% |
| 0.099 | 211 | 5.3% |
| 0.089 | 198 | 5.0% |
| 0.06 | 160 | 4.0% |
| Other values (25) | 1618 |
| Value | Count | Frequency (%) |
| 0.06 | 160 | |
| 0.066 | 156 | |
| 0.075 | 137 | |
| 0.079 | 259 | |
| 0.089 | 198 | |
| 0.099 | 211 | |
| 0.107 | 213 | |
| 0.117 | 324 | |
| 0.124 | 254 | |
| 0.127 | 259 |
| Value | Count | Frequency (%) |
| 0.241 | 2 | 0.1% |
| 0.239 | 6 | 0.2% |
| 0.235 | 6 | 0.2% |
| 0.231 | 4 | 0.1% |
| 0.227 | 6 | 0.2% |
| 0.224 | 15 | 0.4% |
| 0.221 | 19 | |
| 0.217 | 24 | |
| 0.213 | 28 | |
| 0.209 | 39 |
| Distinct | 1923 |
|---|---|
| Distinct (%) | 48.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 375.2073362 |
| Minimum | 32.23 |
|---|---|
| Maximum | 1283.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 32.23 |
|---|---|
| 5-th percentile | 93.88 |
| Q1 | 205.86 |
| median | 336 |
| Q3 | 494.59 |
| 95-th percentile | 813.626 |
| Maximum | 1283.5 |
| Range | 1251.27 |
| Interquartile range (IQR) | 288.73 |
Descriptive statistics
| Standard deviation | 220.261152 |
|---|---|
| Coefficient of variation (CV) | 0.5870385006 |
| Kurtosis | 0.8900854243 |
| Mean | 375.2073362 |
| Median Absolute Deviation (MAD) | 140.06 |
| Skewness | 0.9837168213 |
| Sum | 1483194.6 |
| Variance | 48514.9751 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 330.76 | 27 | 0.7% |
| 396.92 | 25 | 0.6% |
| 325.74 | 22 | 0.6% |
| 386.7 | 21 | 0.5% |
| 339.31 | 20 | 0.5% |
| 334.16 | 19 | 0.5% |
| 322.25 | 19 | 0.5% |
| 343.09 | 18 | 0.5% |
| 190.52 | 18 | 0.5% |
| 368.45 | 17 | 0.4% |
| Other values (1913) | 3747 |
| Value | Count | Frequency (%) |
| 32.23 | 1 | < 0.1% |
| 32.58 | 2 | |
| 33.08 | 2 | |
| 33.55 | 1 | < 0.1% |
| 33.94 | 3 | |
| 34.31 | 1 | < 0.1% |
| 34.5 | 3 | |
| 34.8 | 2 | |
| 35.14 | 1 | < 0.1% |
| 35.31 | 4 |
| Value | Count | Frequency (%) |
| 1283.5 | 1 | < 0.1% |
| 1276.6 | 1 | < 0.1% |
| 1269.73 | 1 | < 0.1% |
| 1243.85 | 1 | < 0.1% |
| 1222.03 | 1 | < 0.1% |
| 1203.66 | 1 | < 0.1% |
| 1200.82 | 2 | 0.1% |
| 1157.66 | 1 | < 0.1% |
| 1142.94 | 1 | < 0.1% |
| 1140.07 | 5 |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.0 KiB |
| B | |
|---|---|
| A | |
| C | |
| D | |
| E | |
| Other values (2) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3953 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B |
|---|---|
| 2nd row | C |
| 3rd row | C |
| 4th row | C |
| 5th row | B |
Common Values
| Value | Count | Frequency (%) |
| B | 1262 | |
| A | 908 | |
| C | 811 | |
| D | 510 | |
| E | 313 | 7.9% |
| F | 125 | 3.2% |
| G | 24 | 0.6% |
Length
Pie chart
| Value | Count | Frequency (%) |
| b | 1262 | |
| a | 908 | |
| c | 811 | |
| d | 510 | |
| e | 313 | 7.9% |
| f | 125 | 3.2% |
| g | 24 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 1262 | |
| A | 908 | |
| C | 811 | |
| D | 510 | |
| E | 313 | 7.9% |
| F | 125 | 3.2% |
| G | 24 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3953 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 1262 | |
| A | 908 | |
| C | 811 | |
| D | 510 | |
| E | 313 | 7.9% |
| F | 125 | 3.2% |
| G | 24 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3953 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 1262 | |
| A | 908 | |
| C | 811 | |
| D | 510 | |
| E | 313 | 7.9% |
| F | 125 | 3.2% |
| G | 24 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3953 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 1262 | |
| A | 908 | |
| C | 811 | |
| D | 510 | |
| E | 313 | 7.9% |
| F | 125 | 3.2% |
| G | 24 | 0.6% |
| Distinct | 35 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.0 KiB |
| B3 | |
|---|---|
| B5 | 260 |
| A4 | 259 |
| B4 | 254 |
| C1 | 231 |
| Other values (30) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 7906 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B2 |
|---|---|
| 2nd row | C4 |
| 3rd row | C5 |
| 4th row | C1 |
| 5th row | B5 |
Common Values
| Value | Count | Frequency (%) |
| B3 | 324 | 8.2% |
| B5 | 260 | 6.6% |
| A4 | 259 | 6.6% |
| B4 | 254 | 6.4% |
| C1 | 231 | 5.8% |
| C2 | 227 | 5.7% |
| B2 | 213 | 5.4% |
| B1 | 211 | 5.3% |
| A5 | 198 | 5.0% |
| A1 | 158 | 4.0% |
| Other values (25) | 1618 |
Length
| Value | Count | Frequency (%) |
| b3 | 324 | 8.2% |
| b5 | 260 | 6.6% |
| a4 | 259 | 6.6% |
| b4 | 254 | 6.4% |
| c1 | 231 | 5.8% |
| c2 | 227 | 5.7% |
| b2 | 213 | 5.4% |
| b1 | 211 | 5.3% |
| a5 | 198 | 5.0% |
| a1 | 158 | 4.0% |
| Other values (25) | 1618 |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 1262 | |
| A | 908 | |
| 2 | 832 | |
| C | 811 | |
| 4 | 806 | |
| 3 | 803 | |
| 1 | 797 | |
| 5 | 715 | |
| D | 510 | |
| E | 313 | 4.0% |
| Other values (2) | 149 | 1.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 3953 | |
| Decimal Number | 3953 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 1262 | |
| A | 908 | |
| C | 811 | |
| D | 510 | |
| E | 313 | 7.9% |
| F | 125 | 3.2% |
| G | 24 | 0.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 832 | |
| 4 | 806 | |
| 3 | 803 | |
| 1 | 797 | |
| 5 | 715 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3953 | |
| Common | 3953 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 1262 | |
| A | 908 | |
| C | 811 | |
| D | 510 | |
| E | 313 | 7.9% |
| F | 125 | 3.2% |
| G | 24 | 0.6% |
Common
| Value | Count | Frequency (%) |
| 2 | 832 | |
| 4 | 806 | |
| 3 | 803 | |
| 1 | 797 | |
| 5 | 715 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7906 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| B | 1262 | |
| A | 908 | |
| 2 | 832 | |
| C | 811 | |
| 4 | 806 | |
| 3 | 803 | |
| 1 | 797 | |
| 5 | 715 | |
| D | 510 | |
| E | 313 | 4.0% |
| Other values (2) | 149 | 1.9% |
Home Ownership
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.0 KiB |
| RENT | |
|---|---|
| MORTGAGE | |
| OWN |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 5.521123198 |
| Min length | 3 |
Characters and Unicode
| Total characters | 21825 |
|---|---|
| Distinct characters | 9 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RENT |
|---|---|
| 2nd row | RENT |
| 3rd row | RENT |
| 4th row | RENT |
| 5th row | RENT |
Common Values
| Value | Count | Frequency (%) |
| RENT | 2081 | |
| MORTGAGE | 1577 | |
| OWN | 295 | 7.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| rent | 2081 | |
| mortgage | 1577 | |
| own | 295 | 7.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 3658 | |
| E | 3658 | |
| T | 3658 | |
| G | 3154 | |
| N | 2376 | |
| O | 1872 | |
| M | 1577 | |
| A | 1577 | |
| W | 295 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 21825 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 3658 | |
| E | 3658 | |
| T | 3658 | |
| G | 3154 | |
| N | 2376 | |
| O | 1872 | |
| M | 1577 | |
| A | 1577 | |
| W | 295 | 1.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21825 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 3658 | |
| E | 3658 | |
| T | 3658 | |
| G | 3154 | |
| N | 2376 | |
| O | 1872 | |
| M | 1577 | |
| A | 1577 | |
| W | 295 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21825 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 3658 | |
| E | 3658 | |
| T | 3658 | |
| G | 3154 | |
| N | 2376 | |
| O | 1872 | |
| M | 1577 | |
| A | 1577 | |
| W | 295 | 1.4% |
Annual Inc
Real number (ℝ≥0)
| Distinct | 813 |
|---|---|
| Distinct (%) | 20.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66175.97354 |
| Minimum | 8280 |
|---|---|
| Maximum | 550000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 8280 |
|---|---|
| 5-th percentile | 25000 |
| Q1 | 40100 |
| median | 57000 |
| Q3 | 80000 |
| 95-th percentile | 135880 |
| Maximum | 550000 |
| Range | 541720 |
| Interquartile range (IQR) | 39900 |
Descriptive statistics
| Standard deviation | 40498.80417 |
|---|---|
| Coefficient of variation (CV) | 0.6119865264 |
| Kurtosis | 18.71426089 |
| Mean | 66175.97354 |
| Median Absolute Deviation (MAD) | 18000 |
| Skewness | 3.058200935 |
| Sum | 261593623.4 |
| Variance | 1640153139 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 60000 | 154 | 3.9% |
| 50000 | 149 | 3.8% |
| 75000 | 120 | 3.0% |
| 40000 | 120 | 3.0% |
| 45000 | 114 | 2.9% |
| 70000 | 96 | 2.4% |
| 30000 | 93 | 2.4% |
| 80000 | 93 | 2.4% |
| 65000 | 88 | 2.2% |
| 35000 | 82 | 2.1% |
| Other values (803) | 2844 |
| Value | Count | Frequency (%) |
| 8280 | 1 | < 0.1% |
| 8400 | 1 | < 0.1% |
| 9600 | 1 | < 0.1% |
| 9960 | 1 | < 0.1% |
| 10000 | 1 | < 0.1% |
| 11000 | 1 | < 0.1% |
| 11340 | 1 | < 0.1% |
| 11820 | 1 | < 0.1% |
| 12000 | 8 | |
| 12252 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 550000 | 1 | < 0.1% |
| 525000 | 1 | < 0.1% |
| 408000 | 1 | < 0.1% |
| 400000 | 2 | 0.1% |
| 365000 | 1 | < 0.1% |
| 350000 | 1 | < 0.1% |
| 325000 | 1 | < 0.1% |
| 300000 | 5 | |
| 290000 | 1 | < 0.1% |
| 281000 | 1 | < 0.1% |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.0 KiB |
| Verified | |
|---|---|
| Not Verified | |
| Source Verified |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 11.37085758 |
| Min length | 8 |
Characters and Unicode
| Total characters | 44949 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Verified |
|---|---|
| 2nd row | Source Verified |
| 3rd row | Not Verified |
| 4th row | Source Verified |
| 5th row | Source Verified |
Common Values
| Value | Count | Frequency (%) |
| Verified | 1515 | |
| Not Verified | 1247 | |
| Source Verified | 1191 |
Length
Pie chart
| Value | Count | Frequency (%) |
| verified | 3953 | |
| not | 1247 | 19.5% |
| source | 1191 | 18.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 9097 | |
| i | 7906 | |
| r | 5144 | |
| V | 3953 | |
| f | 3953 | |
| d | 3953 | |
| o | 2438 | 5.4% |
| 2438 | 5.4% | |
| N | 1247 | 2.8% |
| t | 1247 | 2.8% |
| Other values (3) | 3573 | 7.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36120 | |
| Uppercase Letter | 6391 | 14.2% |
| Space Separator | 2438 | 5.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 9097 | |
| i | 7906 | |
| r | 5144 | |
| f | 3953 | |
| d | 3953 | |
| o | 2438 | 6.7% |
| t | 1247 | 3.5% |
| u | 1191 | 3.3% |
| c | 1191 | 3.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 3953 | |
| N | 1247 | 19.5% |
| S | 1191 | 18.6% |
Space Separator
| Value | Count | Frequency (%) |
| 2438 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 42511 | |
| Common | 2438 | 5.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 9097 | |
| i | 7906 | |
| r | 5144 | |
| V | 3953 | |
| f | 3953 | |
| d | 3953 | |
| o | 2438 | 5.7% |
| N | 1247 | 2.9% |
| t | 1247 | 2.9% |
| S | 1191 | 2.8% |
| Other values (2) | 2382 | 5.6% |
Common
| Value | Count | Frequency (%) |
| 2438 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 44949 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 9097 | |
| i | 7906 | |
| r | 5144 | |
| V | 3953 | |
| f | 3953 | |
| d | 3953 | |
| o | 2438 | 5.4% |
| 2438 | 5.4% | |
| N | 1247 | 2.8% |
| t | 1247 | 2.8% |
| Other values (3) | 3573 | 7.9% |
Loan Writeoff
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.0 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3953 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3275 | |
| 1 | 678 | 17.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 3275 | |
| 1 | 678 | 17.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3275 | |
| 1 | 678 | 17.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3953 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3275 | |
| 1 | 678 | 17.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3953 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3275 | |
| 1 | 678 | 17.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3953 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3275 | |
| 1 | 678 | 17.2% |
PURPOSE
Categorical
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.0 KiB |
| debt_consolidation | |
|---|---|
| credit_card | |
| other | |
| home_improvement | 196 |
| small_business | 145 |
| Other values (8) |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 14.28307614 |
| Min length | 3 |
Characters and Unicode
| Total characters | 56461 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | credit_card |
|---|---|
| 2nd row | car |
| 3rd row | small_business |
| 4th row | other |
| 5th row | other |
Common Values
| Value | Count | Frequency (%) |
| debt_consolidation | 2102 | |
| credit_card | 792 | 20.0% |
| other | 297 | 7.5% |
| home_improvement | 196 | 5.0% |
| small_business | 145 | 3.7% |
| major_purchase | 100 | 2.5% |
| car | 90 | 2.3% |
| wedding | 63 | 1.6% |
| medical | 52 | 1.3% |
| moving | 39 | 1.0% |
| Other values (3) | 77 | 1.9% |
Length
| Value | Count | Frequency (%) |
| debt_consolidation | 2102 | |
| credit_card | 792 | 20.0% |
| other | 297 | 7.5% |
| home_improvement | 196 | 5.0% |
| small_business | 145 | 3.7% |
| major_purchase | 100 | 2.5% |
| car | 90 | 2.3% |
| wedding | 63 | 1.6% |
| medical | 52 | 1.3% |
| moving | 39 | 1.0% |
| Other values (3) | 77 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 7205 | |
| d | 5966 | |
| i | 5525 | |
| t | 5523 | |
| n | 4693 | |
| e | 4206 | |
| c | 3962 | |
| a | 3455 | 6.1% |
| _ | 3341 | 5.9% |
| s | 2819 | 5.0% |
| Other values (12) | 9766 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 53120 | |
| Connector Punctuation | 3341 | 5.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 7205 | |
| d | 5966 | |
| i | 5525 | |
| t | 5523 | |
| n | 4693 | |
| e | 4206 | |
| c | 3962 | |
| a | 3455 | |
| s | 2819 | 5.3% |
| l | 2450 | 4.6% |
| Other values (11) | 7316 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 3341 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 53120 | |
| Common | 3341 | 5.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 7205 | |
| d | 5966 | |
| i | 5525 | |
| t | 5523 | |
| n | 4693 | |
| e | 4206 | |
| c | 3962 | |
| a | 3455 | |
| s | 2819 | 5.3% |
| l | 2450 | 4.6% |
| Other values (11) | 7316 |
Common
| Value | Count | Frequency (%) |
| _ | 3341 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 56461 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 7205 | |
| d | 5966 | |
| i | 5525 | |
| t | 5523 | |
| n | 4693 | |
| e | 4206 | |
| c | 3962 | |
| a | 3455 | 6.1% |
| _ | 3341 | 5.9% |
| s | 2819 | 5.0% |
| Other values (12) | 9766 |
| Distinct | 615 |
|---|---|
| Distinct (%) | 15.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.0 KiB |
| 900xx | 55 |
|---|---|
| 606xx | 55 |
| 100xx | 54 |
| 112xx | 50 |
| 945xx | 49 |
| Other values (610) |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Characters and Unicode
| Total characters | 19765 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 148 ? |
|---|---|
| Unique (%) | 3.7% |
Sample
| 1st row | 860xx |
|---|---|
| 2nd row | 309xx |
| 3rd row | 606xx |
| 4th row | 917xx |
| 5th row | 972xx |
Common Values
| Value | Count | Frequency (%) |
| 900xx | 55 | 1.4% |
| 606xx | 55 | 1.4% |
| 100xx | 54 | 1.4% |
| 112xx | 50 | 1.3% |
| 945xx | 49 | 1.2% |
| 070xx | 45 | 1.1% |
| 331xx | 44 | 1.1% |
| 750xx | 41 | 1.0% |
| 300xx | 41 | 1.0% |
| 113xx | 40 | 1.0% |
| Other values (605) | 3479 |
Length
| Value | Count | Frequency (%) |
| 900xx | 55 | 1.4% |
| 606xx | 55 | 1.4% |
| 100xx | 54 | 1.4% |
| 112xx | 50 | 1.3% |
| 945xx | 49 | 1.2% |
| 070xx | 45 | 1.1% |
| 331xx | 44 | 1.1% |
| 750xx | 41 | 1.0% |
| 300xx | 41 | 1.0% |
| 113xx | 40 | 1.0% |
| Other values (605) | 3479 |
Most occurring characters
| Value | Count | Frequency (%) |
| x | 7906 | |
| 0 | 1903 | 9.6% |
| 1 | 1535 | 7.8% |
| 9 | 1309 | 6.6% |
| 2 | 1309 | 6.6% |
| 3 | 1269 | 6.4% |
| 7 | 1023 | 5.2% |
| 5 | 924 | 4.7% |
| 4 | 914 | 4.6% |
| 8 | 855 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 11859 | |
| Lowercase Letter | 7906 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1903 | |
| 1 | 1535 | |
| 9 | 1309 | |
| 2 | 1309 | |
| 3 | 1269 | |
| 7 | 1023 | |
| 5 | 924 | |
| 4 | 914 | |
| 8 | 855 | |
| 6 | 818 |
Lowercase Letter
| Value | Count | Frequency (%) |
| x | 7906 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 11859 | |
| Latin | 7906 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1903 | |
| 1 | 1535 | |
| 9 | 1309 | |
| 2 | 1309 | |
| 3 | 1269 | |
| 7 | 1023 | |
| 5 | 924 | |
| 4 | 914 | |
| 8 | 855 | |
| 6 | 818 |
Latin
| Value | Count | Frequency (%) |
| x | 7906 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19765 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| x | 7906 | |
| 0 | 1903 | 9.6% |
| 1 | 1535 | 7.8% |
| 9 | 1309 | 6.6% |
| 2 | 1309 | 6.6% |
| 3 | 1269 | 6.4% |
| 7 | 1023 | 5.2% |
| 5 | 924 | 4.7% |
| 4 | 914 | 4.6% |
| 8 | 855 | 4.3% |
Add State
Categorical
| Distinct | 43 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.0 KiB |
| CA | |
|---|---|
| NY | |
| FL | |
| TX | |
| NJ | 181 |
| Other values (38) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Characters and Unicode
| Total characters | 7906 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AZ |
|---|---|
| 2nd row | GA |
| 3rd row | IL |
| 4th row | CA |
| 5th row | OR |
Common Values
| Value | Count | Frequency (%) |
| CA | 729 | |
| NY | 372 | 9.4% |
| FL | 304 | 7.7% |
| TX | 273 | 6.9% |
| NJ | 181 | 4.6% |
| IL | 155 | 3.9% |
| GA | 146 | 3.7% |
| PA | 136 | 3.4% |
| VA | 130 | 3.3% |
| OH | 124 | 3.1% |
| Other values (33) | 1403 |
Length
| Value | Count | Frequency (%) |
| ca | 729 | |
| ny | 372 | 9.4% |
| fl | 304 | 7.7% |
| tx | 273 | 6.9% |
| nj | 181 | 4.6% |
| il | 155 | 3.9% |
| ga | 146 | 3.7% |
| pa | 136 | 3.4% |
| va | 130 | 3.3% |
| oh | 124 | 3.1% |
| Other values (33) | 1403 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1557 | |
| C | 1032 | |
| N | 803 | |
| L | 537 | 6.8% |
| M | 413 | 5.2% |
| Y | 407 | 5.1% |
| T | 394 | 5.0% |
| O | 353 | 4.5% |
| I | 309 | 3.9% |
| F | 304 | 3.8% |
| Other values (14) | 1797 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7906 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1557 | |
| C | 1032 | |
| N | 803 | |
| L | 537 | 6.8% |
| M | 413 | 5.2% |
| Y | 407 | 5.1% |
| T | 394 | 5.0% |
| O | 353 | 4.5% |
| I | 309 | 3.9% |
| F | 304 | 3.8% |
| Other values (14) | 1797 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7906 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1557 | |
| C | 1032 | |
| N | 803 | |
| L | 537 | 6.8% |
| M | 413 | 5.2% |
| Y | 407 | 5.1% |
| T | 394 | 5.0% |
| O | 353 | 4.5% |
| I | 309 | 3.9% |
| F | 304 | 3.8% |
| Other values (14) | 1797 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7906 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1557 | |
| C | 1032 | |
| N | 803 | |
| L | 537 | 6.8% |
| M | 413 | 5.2% |
| Y | 407 | 5.1% |
| T | 394 | 5.0% |
| O | 353 | 4.5% |
| I | 309 | 3.9% |
| F | 304 | 3.8% |
| Other values (14) | 1797 |
DTI
Real number (ℝ≥0)
| Distinct | 1961 |
|---|---|
| Distinct (%) | 49.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.42828738 |
| Minimum | 0 |
|---|---|
| Maximum | 29.85 |
| Zeros | 3 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3.932 |
| Q1 | 9.58 |
| median | 14.45 |
| Q3 | 19.47 |
| 95-th percentile | 24.214 |
| Maximum | 29.85 |
| Range | 29.85 |
| Interquartile range (IQR) | 9.89 |
Descriptive statistics
| Standard deviation | 6.378445753 |
|---|---|
| Coefficient of variation (CV) | 0.4420792008 |
| Kurtosis | -0.7703420751 |
| Mean | 14.42828738 |
| Median Absolute Deviation (MAD) | 4.94 |
| Skewness | -0.04903565752 |
| Sum | 57035.02 |
| Variance | 40.68457022 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11.8 | 9 | 0.2% |
| 18.63 | 8 | 0.2% |
| 20.88 | 8 | 0.2% |
| 12.48 | 7 | 0.2% |
| 9.65 | 7 | 0.2% |
| 17.67 | 7 | 0.2% |
| 19.63 | 7 | 0.2% |
| 16.4 | 7 | 0.2% |
| 16.2 | 7 | 0.2% |
| 18.84 | 7 | 0.2% |
| Other values (1951) | 3879 |
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 0.02 | 2 | |
| 0.07 | 1 | < 0.1% |
| 0.2 | 1 | < 0.1% |
| 0.25 | 1 | < 0.1% |
| 0.32 | 2 | |
| 0.34 | 1 | < 0.1% |
| 0.41 | 1 | < 0.1% |
| 0.55 | 1 | < 0.1% |
| 0.57 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 29.85 | 1 | |
| 29.83 | 1 | |
| 29.73 | 1 | |
| 29.72 | 1 | |
| 29.63 | 1 | |
| 29.44 | 2 | |
| 29.36 | 1 | |
| 29.35 | 1 | |
| 29.29 | 1 | |
| 29.26 | 1 |
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1085251708 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 3628 |
| Zeros (%) | 91.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4087983222 |
|---|---|
| Coefficient of variation (CV) | 3.766852606 |
| Kurtosis | 32.99870086 |
| Mean | 0.1085251708 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.954297207 |
| Sum | 429 |
| Variance | 0.1671160683 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3628 | |
| 1 | 246 | 6.2% |
| 2 | 61 | 1.5% |
| 3 | 13 | 0.3% |
| 4 | 4 | 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 3628 | |
| 1 | 246 | 6.2% |
| 2 | 61 | 1.5% |
| 3 | 13 | 0.3% |
| 4 | 4 | 0.1% |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 4 | 4 | 0.1% |
| 3 | 13 | 0.3% |
| 2 | 61 | 1.5% |
| 1 | 246 | 6.2% |
| 0 | 3628 |
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8555527448 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 1822 |
| Zeros (%) | 46.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.997025005 |
|---|---|
| Coefficient of variation (CV) | 1.165357731 |
| Kurtosis | 2.163689287 |
| Mean | 0.8555527448 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.26526022 |
| Sum | 3382 |
| Variance | 0.9940588606 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1822 | |
| 1 | 1245 | |
| 2 | 584 | 14.8% |
| 3 | 265 | 6.7% |
| 4 | 21 | 0.5% |
| 5 | 10 | 0.3% |
| 6 | 3 | 0.1% |
| 7 | 2 | 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1822 | |
| 1 | 1245 | |
| 2 | 584 | 14.8% |
| 3 | 265 | 6.7% |
| 4 | 21 | 0.5% |
| 5 | 10 | 0.3% |
| 6 | 3 | 0.1% |
| 7 | 2 | 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 7 | 2 | 0.1% |
| 6 | 3 | 0.1% |
| 5 | 10 | 0.3% |
| 4 | 21 | 0.5% |
| 3 | 265 | 6.7% |
| 2 | 584 | 14.8% |
| 1 | 1245 | |
| 0 | 1822 |
Pub Rec
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 31.0 KiB |
| 0 | |
|---|---|
| 1 | 120 |
| 2 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3953 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3831 | |
| 1 | 120 | 3.0% |
| 2 | 2 | 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 3831 | |
| 1 | 120 | 3.0% |
| 2 | 2 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3831 | |
| 1 | 120 | 3.0% |
| 2 | 2 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3953 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3831 | |
| 1 | 120 | 3.0% |
| 2 | 2 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3953 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3831 | |
| 1 | 120 | 3.0% |
| 2 | 2 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3953 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3831 | |
| 1 | 120 | 3.0% |
| 2 | 2 | 0.1% |
| Distinct | 3672 |
|---|---|
| Distinct (%) | 92.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14367.44751 |
| Minimum | 0 |
|---|---|
| Maximum | 140967 |
| Zeros | 42 |
| Zeros (%) | 1.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1240.4 |
| Q1 | 6352 |
| median | 11449 |
| Q3 | 18151 |
| 95-th percentile | 35148.4 |
| Maximum | 140967 |
| Range | 140967 |
| Interquartile range (IQR) | 11799 |
Descriptive statistics
| Standard deviation | 13468.63453 |
|---|---|
| Coefficient of variation (CV) | 0.937441012 |
| Kurtosis | 18.01764983 |
| Mean | 14367.44751 |
| Median Absolute Deviation (MAD) | 5657 |
| Skewness | 3.322035836 |
| Sum | 56794520 |
| Variance | 181404116.1 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 42 | 1.1% |
| 8032 | 3 | 0.1% |
| 13034 | 3 | 0.1% |
| 14848 | 3 | 0.1% |
| 10980 | 3 | 0.1% |
| 6565 | 3 | 0.1% |
| 15183 | 3 | 0.1% |
| 11338 | 3 | 0.1% |
| 18467 | 3 | 0.1% |
| 8357 | 3 | 0.1% |
| Other values (3662) | 3884 |
| Value | Count | Frequency (%) |
| 0 | 42 | |
| 3 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 41 | 2 | 0.1% |
| 50 | 1 | < 0.1% |
| 62 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 140967 | 1 | |
| 131949 | 1 | |
| 130920 | 1 | |
| 124744 | 1 | |
| 123416 | 1 | |
| 120504 | 1 | |
| 112522 | 1 | |
| 110856 | 1 | |
| 108339 | 1 | |
| 106406 | 1 |
| Distinct | 3710 |
|---|---|
| Distinct (%) | 93.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14435.06432 |
| Minimum | 0 |
|---|---|
| Maximum | 58886.47343 |
| Zeros | 2 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 31.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2401.064047 |
| Q1 | 6614.78722 |
| median | 11907.35 |
| Q3 | 19190.68001 |
| 95-th percentile | 35788.92425 |
| Maximum | 58886.47343 |
| Range | 58886.47343 |
| Interquartile range (IQR) | 12575.89279 |
Descriptive statistics
| Standard deviation | 10492.53033 |
|---|---|
| Coefficient of variation (CV) | 0.7268779753 |
| Kurtosis | 1.593830926 |
| Mean | 14435.06432 |
| Median Absolute Deviation (MAD) | 5937.176941 |
| Skewness | 1.261678967 |
| Sum | 57061809.25 |
| Variance | 110093192.6 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 14288.76169 | 8 | 0.2% |
| 13148.13786 | 7 | 0.2% |
| 11907.34732 | 7 | 0.2% |
| 12029.45 | 7 | 0.2% |
| 11600.98 | 6 | 0.2% |
| 10956.77596 | 5 | 0.1% |
| 9011.557494 | 5 | 0.1% |
| 14288.77 | 5 | 0.1% |
| 11726.32 | 5 | 0.1% |
| 13263.96 | 5 | 0.1% |
| Other values (3700) | 3893 |
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 91.39 | 1 | |
| 151.8 | 1 | |
| 165.37 | 1 | |
| 203.55 | 1 | |
| 258.46 | 1 | |
| 262.7 | 1 | |
| 309.36 | 1 | |
| 328.01 | 1 | |
| 331.83 | 1 |
| Value | Count | Frequency (%) |
| 58886.47343 | 1 | |
| 58133.3199 | 1 | |
| 58090.95207 | 1 | |
| 58071.19982 | 1 | |
| 58071.19977 | 1 | |
| 57997.27995 | 1 | |
| 57143.25996 | 1 | |
| 57117.89995 | 1 | |
| 56681.8859 | 1 | |
| 56681.88585 | 1 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Name | Email ID | Gender | Dt_Applied | University | Loan Amnt | Funded amnt inv | TERM | Int Rate | INSTALLMENT | GRADE | Sub Grade | Home Ownership | Annual Inc | Verification Status | Loan Writeoff | PURPOSE | Zip Code | Add State | DTI | Delinq 2Yrs | Inq Last 6Mths | Pub Rec | Revol Bal | Total Paymnt | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Calley Giron | cgiron0@ehow.com | Female | 01/01/81 | Warner Southern College | 5000 | 4975.0 | 36 months | 0.107 | 162.87 | B | B2 | RENT | 24000.0 | Verified | 0 | credit_card | 860xx | AZ | 27.65 | 0 | 1 | 0 | 13648 | 5863.155187 |
| 1 | Linus Stud | lstud1@washington.edu | Male | 02/01/81 | Shri Lal Bahadur Shastri Rashtriya Sanskrit Vidyapeetha | 2500 | 2500.0 | 60 months | 0.153 | 59.83 | C | C4 | RENT | 30000.0 | Source Verified | 1 | car | 309xx | GA | 1.00 | 0 | 5 | 0 | 1687 | 1014.530000 |
| 2 | Lorelle Ambage | lambage2@wix.com | Female | 03/01/81 | Technische Universität Bergakademie Freiberg | 2400 | 2400.0 | 36 months | 0.160 | 84.33 | C | C5 | RENT | 12252.0 | Not Verified | 0 | small_business | 606xx | IL | 8.72 | 0 | 2 | 0 | 2956 | 3005.666844 |
| 3 | Anna-diane Larrat | alarrat3@economist.com | Female | 04/01/81 | Divine Word College of Legazpi | 10000 | 10000.0 | 36 months | 0.135 | 339.31 | C | C1 | RENT | 49200.0 | Source Verified | 0 | other | 917xx | CA | 20.00 | 0 | 1 | 0 | 5598 | 12231.890000 |
| 4 | Gill Ruske | NaN | Female | 05/01/81 | East China Jiao Tong University | 3000 | 3000.0 | 60 months | 0.127 | 67.79 | B | B5 | RENT | 80000.0 | Source Verified | 0 | other | 972xx | OR | 17.94 | 0 | 0 | 0 | 27783 | 4066.908161 |
| 5 | Evelyn MacFaul | emacfaul5@theatlantic.com | Female | 06/01/81 | Ahmedabad University | 5000 | 5000.0 | 36 months | 0.079 | 156.46 | A | A4 | RENT | 36000.0 | Source Verified | 0 | wedding | 852xx | AZ | 11.20 | 0 | 3 | 0 | 7963 | 5632.210000 |
| 6 | Ainslie Rainard | arainard6@virginia.edu | Female | 07/01/81 | NaN | 7000 | 7000.0 | 60 months | 0.160 | 170.08 | C | C5 | RENT | 47004.0 | Not Verified | 0 | debt_consolidation | 280xx | NC | 23.51 | 0 | 1 | 0 | 17726 | 10137.840010 |
| 7 | Emmott Hamby | ehamby7@prnewswire.com | Male | 08/01/81 | Institute of Business Management | 3000 | 3000.0 | 36 months | 0.186 | 109.43 | E | E1 | RENT | 48000.0 | Source Verified | 0 | car | 900xx | CA | 5.35 | 0 | 2 | 0 | 8221 | 3939.135294 |
| 8 | Shem Toomer | stoomer8@home.pl | Male | 09/01/81 | Osaka University of Education | 5600 | 5600.0 | 60 months | 0.213 | 152.39 | F | F2 | OWN | 40000.0 | Source Verified | 1 | small_business | 958xx | CA | 5.55 | 0 | 2 | 0 | 5210 | 647.500000 |
| 9 | Giana Aberhart | gaberhart9@mozilla.com | Female | 10/01/81 | American Public University | 5375 | 5350.0 | 60 months | 0.127 | 121.45 | B | B5 | RENT | 15000.0 | Verified | 1 | other | 774xx | TX | 18.08 | 0 | 0 | 0 | 9279 | 1484.590000 |
Last rows
| Name | Email ID | Gender | Dt_Applied | University | Loan Amnt | Funded amnt inv | TERM | Int Rate | INSTALLMENT | GRADE | Sub Grade | Home Ownership | Annual Inc | Verification Status | Loan Writeoff | PURPOSE | Zip Code | Add State | DTI | Delinq 2Yrs | Inq Last 6Mths | Pub Rec | Revol Bal | Total Paymnt | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3943 | Merla Thebe | mthebeq7@cocolog-nifty.com | Female | 21/10/91 | North Eastern Hill University | 6000 | 6000.0 | 36 months | 0.163 | 211.81 | D | D1 | RENT | 39564.0 | Verified | 1 | debt_consolidation | 606xx | IL | 23.78 | 2 | 1 | 0 | 2028 | 3388.960000 |
| 3944 | Marcellina Dinneges | mdinnegesq8@infoseek.co.jp | Female | 22/10/91 | Universidade Católica de Santos | 2400 | 2400.0 | 36 months | 0.117 | 79.39 | B | B3 | RENT | 39800.0 | Not Verified | 0 | other | 303xx | GA | 14.32 | 0 | 0 | 0 | 15497 | 2836.660516 |
| 3945 | Way Symonds | wsymondsq9@mlb.com | Male | 23/10/91 | American International University West Africa | 25000 | 25000.0 | 60 months | 0.183 | 638.25 | D | D5 | MORTGAGE | 156000.0 | Source Verified | 0 | house | 944xx | CA | 5.85 | 0 | 0 | 0 | 10709 | 37936.750000 |
| 3946 | Ailene Matejka | NaN | Female | 24/10/91 | Kaya University | 20000 | 20000.0 | 36 months | 0.117 | 661.52 | B | B3 | RENT | 80700.0 | Verified | 0 | debt_consolidation | 946xx | CA | 13.67 | 0 | 1 | 0 | 7211 | 23406.523000 |
| 3947 | Samuel Overel | NaN | Male | 25/10/91 | Northwestern University | 12000 | 12000.0 | 60 months | 0.183 | 306.36 | D | D5 | MORTGAGE | 34000.0 | Not Verified | 1 | debt_consolidation | 177xx | PA | 12.56 | 0 | 0 | 0 | 6114 | 9667.950000 |
| 3948 | Corbie Creeboe | ccreeboeqc@sitemeter.com | Male | 26/10/91 | Shaheed Rajaei Teacher Training University | 12000 | 12000.0 | 36 months | 0.135 | 407.17 | C | C1 | RENT | 125000.0 | Source Verified | 0 | wedding | 086xx | NJ | 13.18 | 0 | 1 | 0 | 46286 | 14657.917650 |
| 3949 | Bobbe Ochterlonie | bochterlonieqd@ezinearticles.com | Female | 27/10/91 | Dhofar University | 15000 | 15000.0 | 36 months | 0.124 | 501.23 | B | B4 | RENT | 72000.0 | Verified | 0 | debt_consolidation | 104xx | NY | 7.47 | 0 | 1 | 0 | 12147 | 16729.253640 |
| 3950 | Corella Esposito | cespositoqe@macromedia.com | Female | 28/10/91 | University of Jan Evangelista Purkyne | 12000 | 12000.0 | 36 months | 0.060 | 365.23 | A | A1 | OWN | 48000.0 | Not Verified | 0 | debt_consolidation | 365xx | AL | 23.35 | 0 | 0 | 0 | 22385 | 13148.137860 |
| 3951 | Prince Dibdin | pdibdinqf@businessinsider.com | Male | 29/10/91 | College in Sládkovičovo | 15000 | 15000.0 | 60 months | 0.160 | 364.46 | C | C5 | RENT | 50000.0 | Verified | 1 | debt_consolidation | 907xx | CA | 18.26 | 0 | 1 | 0 | 9799 | 10883.540000 |
| 3952 | Georgette Warratt | gwarrattqg@java.com | Female | 30/10/91 | Technical University of Lublin | 15000 | 14975.0 | 60 months | 0.153 | 358.98 | C | C4 | MORTGAGE | 32976.0 | Not Verified | 1 | debt_consolidation | 177xx | PA | 17.90 | 0 | 1 | 0 | 7956 | 11704.260000 |